Picture for Zhuoling Li

Zhuoling Li

ToolFG: Towards Well-Grounded Fine-Grained Image Classification

Add code
Jun 01, 2026
Viaarxiv icon

XGRAG: A Graph-Native Framework for Explaining KG-based Retrieval-Augmented Generation

Add code
Apr 27, 2026
Viaarxiv icon

DiffGraph: An Automated Agent-driven Model Merging Framework for In-the-Wild Text-to-Image Generation

Add code
Mar 20, 2026
Viaarxiv icon

Any3D-VLA: Enhancing VLA Robustness via Diverse Point Clouds

Add code
Jan 31, 2026
Viaarxiv icon

Train Once, Deploy Anywhere: Realize Data-Efficient Dynamic Object Manipulation

Add code
Aug 19, 2025
Viaarxiv icon

Bootstrapping Imitation Learning for Long-horizon Manipulation via Hierarchical Data Collection Space

Add code
May 23, 2025
Viaarxiv icon

SceneLLM: Implicit Language Reasoning in LLM for Dynamic Scene Graph Generation

Add code
Dec 15, 2024
Figure 1 for SceneLLM: Implicit Language Reasoning in LLM for Dynamic Scene Graph Generation
Figure 2 for SceneLLM: Implicit Language Reasoning in LLM for Dynamic Scene Graph Generation
Figure 3 for SceneLLM: Implicit Language Reasoning in LLM for Dynamic Scene Graph Generation
Figure 4 for SceneLLM: Implicit Language Reasoning in LLM for Dynamic Scene Graph Generation
Viaarxiv icon

VIRT: Vision Instructed Transformer for Robotic Manipulation

Add code
Oct 09, 2024
Figure 1 for VIRT: Vision Instructed Transformer for Robotic Manipulation
Figure 2 for VIRT: Vision Instructed Transformer for Robotic Manipulation
Figure 3 for VIRT: Vision Instructed Transformer for Robotic Manipulation
Figure 4 for VIRT: Vision Instructed Transformer for Robotic Manipulation
Viaarxiv icon

TranSplat: Generalizable 3D Gaussian Splatting from Sparse Multi-View Images with Transformers

Add code
Aug 25, 2024
Figure 1 for TranSplat: Generalizable 3D Gaussian Splatting from Sparse Multi-View Images with Transformers
Figure 2 for TranSplat: Generalizable 3D Gaussian Splatting from Sparse Multi-View Images with Transformers
Figure 3 for TranSplat: Generalizable 3D Gaussian Splatting from Sparse Multi-View Images with Transformers
Figure 4 for TranSplat: Generalizable 3D Gaussian Splatting from Sparse Multi-View Images with Transformers
Viaarxiv icon

LARM: Large Auto-Regressive Model for Long-Horizon Embodied Intelligence

Add code
May 27, 2024
Figure 1 for LARM: Large Auto-Regressive Model for Long-Horizon Embodied Intelligence
Figure 2 for LARM: Large Auto-Regressive Model for Long-Horizon Embodied Intelligence
Figure 3 for LARM: Large Auto-Regressive Model for Long-Horizon Embodied Intelligence
Figure 4 for LARM: Large Auto-Regressive Model for Long-Horizon Embodied Intelligence
Viaarxiv icon